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Abstract. Shapely types separate data, represented by lists, from shape, 
or structure. This separation supports shape polymorphism, where oper¬ 
ations are defined for arbitrary shapes, and shapely operations, for which 
the shape of the result is determined by that of the input, permitting 
static shape checking. They include both arrays and the usual algebraic 
types (of trees, graphs, etc.), and are closed under the formation of initial 
algebras. 


1 Introduction 


Consider the operation map which applies a function to each element of a list. In 
existing functional languages, its type is 

(a —►/?)—*a list—►/? list 


where a and 0 may range over any types. This data polymorphism allows the 
data (a and 0) to vary, but uses a fixed shape, list. Shape polymorphism fixes 
the data, but allows the shape to vary, so that, for types A and B, instances of 
map include 

(A-*B)-*A tree— *B tree 


and 

(A-+B)—>A matrix— *B matrix 

In each case map(/) applies / to the data (the leaves or entries), while leaving 
the shape fixed. Typically, both kinds of polymorphism co-exist, so that map can 
vary both its data and its shape. 

Shape polymorphism applies to the usual algebraic types, (and others, such 
as matrices) but, being restricted to covariant constructions, does not address 
contravariant types of, say, functions. 

The appropriate class of types are the shapely types, whose shape and data 
can be separated. With only one kind of data, this separation is represented by 
a pullback 


Shap< 


arity 


N 
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Values of a shapely type FA are uniquely determined by a list of A’s and a 
shape of type Shapes, such that the arity of the shape equals the length of the 
list. Shape polymorphism arises when the operations on the data and shape are 
completely independent of each other. 

It oftens happens, particularly in scientific computation and data-processing, 
that the shape can influence the data, but not conversely. If computation is 
restricted to such shapely operations then the shape information can be treated 
as if it is part of the type system, with all shape computations, including shape 
checking, performed before executing the list operations. The latter can then be 
optimised, or run in parallel, without sacrificing (and indeed strengthening) the 
benefits of typing. 

These ideas can be understood within a semantics based on sets, or of bottom¬ 
less c.p.o’s, but for generality (and flexibility) are presented in a locos [Coc90], 
which provides a minimal setting for working with both lists and pullbacks. In 
particular, no higher types are assumed (as is shown feasible in the language 
Charity[CF92]) since they contribute nothing to the theory of shape. A calculus 
of shapely functors and natural transformations is introduced, with the shapely 
type constructors being those functors which have a shapely transformation to 
the list functor (or a multi-parameter analogue of it). 

Most of this paper is devoted to the introduction of concepts. The main 
result of the paper is that the shapely type constructors are closed under the 
construction of initial algebras. That is, once we have lists then we have all the 
other “algebraic” types of trees, graphs, queues, etc. The additional presence 
of arrays, and other types defined by pullbacks, means that the shapely types 
lie between these algebraic types, and the class of initial algebras for arbitrary 
functors. Full details of proofs can be found in [JC94]. 

2 Locoses 

The types and operations are modelled by the objects and arrows of a category 
C. It must have lists (and the underlying products and coproducts required to 
define them) and enough pullbacks to work with shapes. Specifying such a class 
of pullbacks (as was done for the Boolean categories of [Man92]) at this stage 
would impose an unwelcome burden so, to simplify slightly, we will assume that 
we have all pullbacks, and work in a finitely complete, distributive category (or 
lextensive category [CLW93]) which has all list objects, i.e. a locos [Coc90]. Being 
extensive is equivalent to requiring that all coproduct diagrams have disjoint 
(monomorphic) inclusions, and are stable under pulling back. Examples include 
the usual semantic categories, including those of sets, bottomless c.p.o.’s, or even 
topological spaces, any one of which will suffice to illustrate the ideas below. 

Let us fix some notation. If / : C—>A and g : C-+B are morphisms then 
if, 9) ■ C^AxB is their pairing. The left and right projections from the product 
are n A ,B and F AB (respectively) and the unique morphism to the terminal object 
is U : A—* 1. The symmetry for the product is denoted c AiB : AxB^BxA. 
Dually, the coproduct inclusions are given by t AjB : A—>A+B and i' A B : B—rA + 
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B. If / : A—*C and g : B-^C then their case analysis is given by [/, g\ : A + 
£?—»-C. The functors 77, B : C n —+C denote chosen n-fold products and coproducts, 
respectively, and A : C—>C n is the diagonal functor. 

The distributive law is witnessed by a natural isomorphism 

d A ,B,c :Ax(B + C)—>(AxB) + {AxC) 
whose inverse is [idxt, idxt']. 

Subscripts on natural transformations will be omitted unless required to dis¬ 
ambiguate an expression. 

A pullback is a commuting square 



such that, for every pair of morphism x : X—>A and y : X—*B such that f o x = 
goy there is a unique morphism z : X—*P such that po z = x and q o z = y. We 
denote z by ( x,y) (the usual pairing is a special case). 

The list constructor is a functor L : C-+C. Its basic operations are denoted 


nil: l^LA (1) 

cons : AxLA-+LA (2) 

loldr(*, h) : LA-+C (3) 


where x : B—*C and h : AxC-+C are morphisms. nil and cons are the usual 
constructors, while ioldr(x, h) is the unique morphism making the following 
diagram commute 



It follows that [nil, cons] : l + (AxLA)^LA is an isomorphism, which expresses 
LA as a. coproduct. 




From these primitives we can construct the usual family of list operations, 
whose notation is a mixture of the list notation of [BW88] and categorical nota¬ 
tion for monads: 


Lf : LA^LB 
ri : A-*LA 
@ : LAxLA^LA 
snoc : LAxA—>LA 
H : L 2 A-+LA 
g* : LA^LB 


is map(/) for / : A-^B 
makes singleton lists 
is append 

is cons on the tail of the list 
flattens a list of lists 
is // o Lg for g : A—*LB. 


Many elementary results about lists in locoses can be found in [Jay93b], 

LI is a natural numbers object N with zero 0 and successor S given by nil 
and cons respectively. Then r] = one and @ = + is addition and p : LN—+N 
is summation. Let Eq be the equality on N. The length of a list object LA is 
L! = # : LA^N. 

Define shunt : LAxLA-^LAxLA to be 

LAxLAZ LA + (LAxAxLA) s»oc»<id] IAx£A 


where the isomorphism is given by the coproduct decomposition of LA and the 
distributive law. Then 

split : NxLA-^LAxLA 

is given by foldr((nil, id), shunt). It divides a list into two segments, whose 
first, initial segment, has length given by the first projection (if the list is long 
enough). Define 


take = 7ro split : N xLA—>LA 
drop = tt' o split: N xLA—*LA . 
Lemma 1. @ o split = k' . Hence, we have a pullback 
(#° 


LCxLC - 

l-J 


► NxLC 


o (tt, # o take) 


(4) 

(5) 


Proof. Both sides of the equation equal foldr(id, id). 

Given x : X—*NxLC for which Eqo (n-, take) o x = true then the induced 
morphism into the pullback is split ox. □ 

The powers C n of C are also locoses, and they have a right C-action. That is, 
a functor C n x C—>C n which maps A — (A \, A 2 ,..., A n ) and B to 

AxB = (A 1 xB,A 2 xB,. ,.,A n xB) 
and has the obvious action on morphisms. 
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3 Shapely Functors and Transformations 


The essential ideas of this section can be found in [Jay93a]. 

A strength for a functor F : C m —*C is a natural transformation 

t a ,b : FAxB—*F(AxB) 

which satisfies the usual associativity and unicity axioms. More generally, a 
strength for F : C m —>C n is given by a strength for each of its projections onto 
C. See [CS92] for an account of the connections between strength and fibrations. 

(F, r) is a shapely functor if F is stable (preserves all pullbacks) and r is a 
strength for it. Then FI is the object of F-shapes and # = F\ : FA~*F 1 is 
the shape of FA (generalising the length of a list). As F is stable, the following 
diagram is a pullback 

F(AxB) FB 

Fir # 

FA --—* FI 

# 

If FAx#FB is a canonical choice of pullback (representing pairs that have the 
same shape) then there is an isomorphism 

zip : FAx#FB^F(AxB) 

which is inverse to (Fir, Fir') and generalises the usual zip of lists. 

The shapely functors include the product, coproduct and list functors, and 
are closed under composition and pairing. 

A natural transformation a : F=>G is cartesian if, for every morphism f : 
A—+B, the following square is a pullback 


Ff 


* GA 
Gf 
- GB 


A strong natural transformation (F, n)=>-(G, rf) between strong functors is 
a natural transformation a : F=>G that commutes with the strengths, i.e. 


FAxB 


Q^xid 


GAxB 


F(AxB) - 


- G(AxB) 


a AXB 
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If, further, F and G are shapely and a is cartesian, then a is a shapely transfor¬ 
mation and F is shapely over G by a. 

A consequence of being an extensive category is that the coproduct inclusions 
are shapely; cartesian-ness is by definition, and the strength is given by the 
distributive law. 

Projections from the product x : C 2 —*C, though strong, are never shapely. 
(If it were so then all the transformations of interest would be shapely.) Instead, 
given / : A-^C and g : B—*D we have the pullback 


AxD - 

|J 


/xidj 
CxD - 


which shows that the transformation 7r : (—)x£)=>id : C-+C is cartesian. Hence, 
7 ta,b is shapely in A. 

The shapely transformations are closed under vertical and horizontal com¬ 
position, so that for each locos C we have a 2-category of shapely functors and 
natural transformations. They are also closed under pairing and case analysis. 
Here are two more general results. 

Proposition 2. Let F : C 2 —*C be a shapely functor and let G, H : C-+C be any 
functors. Suppose that for each object B the transformation cxa,b ■ F(A, B)—+GA 
is cartesian in A. Similarly, suppose that for each object A the transformation 
Pa,b ■ F(A, B)—*HB is cartesian in B. Then 

( a A,B>S a,b) : F(A,B)^GAxHB 

is a cartesian in both A and B. 

Proof. Consider a commuting square 

X- GAxHB 

z GgxHh 

F(A',B') — — GA'xHB' 

{<*, 0 ) 

Then x and z induce a unique morphism x' : X—>F(A, B') by the cartesian-ness 
of a. Similarly y and 2 induce a morphism t/ : X^F{A', B ). As F is shapely, 
x‘ and if induce the desired morphism X^>F(A, B). □ 
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Theorem 3. If a : F=>G and /? : HxG=>G are shapely transformations, then 
7 A = fol dr(a A ,0A) : LHAxFA-*GA 
is a shapely transformation. 

Proof. The result generalises [Jay93a, Theorem 2.6] without changing the proof. 

□ 

Corollary 4. @ = f oldr(nil, cons) and fi = foldr(nil, @) are shapely. □ 

The following lemma shows how strength and cartesian-ness interact. 

Lemma 5. If a : F=^G is cartesian and (G, T2) is shapely then there is a unique 
strength T\ for F such that (F, Ti) and a are both shapely. 

Proof. If F has a strength ri that makes a strong then the following diagram 
must commute. 

FAxB aXld > GAxB 



Since the square is a pullback, this determines ri uniquely. Conversely, this 
pullback can be used to define 77. Its naturality, associativity and unicity are all 
inherited from that of r^. □ 


4 Shapely Types 


A functor F : C—>C is a shapely type constructor if it is shapely over L. More 
generally, a functor F : C m —>C n is a shapely type constructor if it is shapely over 
AIIL m . Equivalently, each of the projections of F onto C must be shapely over 
IlL m . If 6 is the relevant shapely transformation then we have the pullback 


F(Ai)- 

I -I 


- n(LAi) 

I# 


FI 


N’ 
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F(A ,) is a (tuple of) shapely types, and Si is also known as the arity of FI. 

The shapes can be thought of as having fixed numbers of holes or entries of 
each type, which are filled in by the data. 

Of course, lists are shapely by the identity transformation. Also, // : L 2 =>L 
makes L 2 a shapely type constructor, with shapes given by lists of numbers. 

The shape of a matrix is given by its dimensions, which are of type N 2 . The 
defining pullback is 



N 2 -- N 


where MA is the type of matrices with entries from A. (They can also be defined 
as vectors of vectors [Jay93a]. Matrix multiplication, and general operations of 
linear algebra can all be defined in this way. 

Binary trees with leaves labelled by A’s have shapes given by (unlabelled) 
trees, whose arity is their number of leaves, and data given by their list of leaves. 



leaves 

The pullback construction does not determine the order of elements in the list 
of leaves, which could be listed from left to right, right to left, or in some more 
arcane fashion. Let us stick with left-to-right order, unless specifically changed, 
map on trees is given by the induced morphism into the pullback 



Note that the shape has remained fixed while the data changes. Of course, this 
construction generalises to define map for any shapely type constructor. 
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Other operations change the shape and leave the data fixed. For example, 
the balancing of a binary tree is given by an operation bal^ : TA-^TA which 
is independent of the labels 


TA -- LA 



leaves 


Finally, both data and shape may change together, as occurs with the node 
operation, which creates a tree from two subtrees. 



leaves 

The number of leaves in the result is the sum of those in the sub-trees, while the 
lists of leaves must be appended. Note that if leaves represents the leaves from 
right to left then the order of the arguments must be swapped to define @. In 
some sense, the choice of @ here fixes the representation of the leaves. 

Trees whose nodes are also labelled ar shapely, too. The defining pullback is 



T1 — 7 -r- NxN 

(leaves,nodes) 

Most of the usual first-order data types of functional languages can be con¬ 
structed in this way, and arrays are available within the same framework, too. 
Note that, since shapely type constructors are always covariant functors, con- 
travariant constructions, such as function types, cannot be shapely. 
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4.1 An Alternative Approach 

In defining shapely type constructors, the data is given by a tuple of lists, one for 
each kind of data whose type is LA\xLA 2 x ... xLA„. What happens if these 
lists are interleaved to resemble input strings, of type L(Ai + A 2 + ... + A n )? In 
other words, one could consider the functors which are shapely over LS instead 
of IIL. In fact, the resulting class of functors is unchanged, as shown by the 
following 

Proposition6. II L m and LI 7 are each shapely over the other. 

Proof. Clearly, there is shapely natural transformation IIL m ^LS obtained by 
concatenation. Conversely, define the natural transformation check.45 by 

(A + B)xLA — {AxLA) + ( BxLA ) LA 

It is shapely in A whence = f oldr(nil, check) is, too. This can be generalised 
to define : L{A\ + A 2 + ... + A„)—>LAi which strips from a list all entries 
which are not from A,-. It is shapely in A{. The obvious n-fold generalisation of 
Proposition 2 shows that 

k = («,•) : L2=>IIL m 

is a shapely transformation. □ 

5 Shape Polymorphism 

Shape polymorphism arises when the operations on the shape and on the data 
are independent of each other. Currently, such operations must redefined for 
each new type, increasing the bulk of the code and reducing clarity. Aside from 
map, the shape polymorphic operations include 


zip : FAx#FB^-F(AxB) 

(6) 

r : FAxB—*F{AxB) 

(7) 

t' : AxFB^F{AxB) 

(8) 

t" = Ft'ot: FAxFB^F 2 (AxB) 

(9) 

copies : FlxA~+FA 

(10) 

square = r 0 {#, id) : FA—>F 2 A . 

(11) 


t' is dual to t and the operation copies instantiates all entries of the shape to 
the given value, while square replaces each entry with a copy of the whole. 

Closely related to map are the pointwise operators introduced by example in 
[Jon90] and defined in [Jay93a], These iterate an endomorphism at each entry 
in a shape. The number of iterations at each entry is determined by a weight on 
the shape i.e. a morphism FI— >FN. Particular shapes may have special weights 
(e.g. one can weight each leaf in a tree by its depth) but weights on lists yield 
shape polymorphic operations. Examples include weighting each entry by the 
length of the list, or by its position. Their use in defining the discrete Fourier 
transform op. cit. shows it to be shape polymorphic. 
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6 Shapely Operations 


In general computation, the shape of the result is influenced by that of the data. 
Examples include filtering of a list, or graph reduction. There is, however, a 
large body of computations where the shape of the result depends only on that 
of the input, without reference to the data. As well as the shape polymorphic 
operations, these include many operations where the shape affects the data, but 
not conversely. Examples include averaging of entries, pointwise operators, and 
many algorithms, such as the DFT. 

For such shapely operations it is profitable to separate the internal repre¬ 
sentations of shape and data. Then shape processing can be treated as part of 
compilation, in which shape errors are detected (e.g. attempting to zip two dif¬ 
ferent shapes), the shape of the result is computed, and any shape information 
required by the data is supplied. Data can then be stored and processed in ar¬ 
rays. In this way, the clarity of type (and shape! )-checking is combined with the 
efficiency of array-processing. 

The main points are illustrated by the decomposition of a tree into either a 
leaf or a pair of sub-trees. 


TA —=— A + ( TA) 2 — - t£ A + (LA) 2 LA 

l-J J 


T1 -- 1 + (Tl) 2 —-1 + N‘ 


id + 6 2 [one, +] 


The shape of the result is determined by that of the input, but in order to 
know where to break the list of leaves, the number n of leaves in the left sub¬ 
tree is required. Shape processing would add the computed value of n to the 
environment prior to the array-processing. 


7 Initial Algebras 

7.1 Endofunctors 


Let F be shapely over L. The initial algebra Fq will be constructed as an object 
of well-formed expressions. The alphabet is given by 12 = FI. The well-formed 
expressions are described by a pullback 



(nil, one) 


LfixN 



where Xi attempts to recognise the well-formed expressions. More precisely, its 
second component counts the number of well-formed expressions created, while 
the first component yields that part of the input string (if any) which could not 
be recognised. 

Observe that ! : FI —>1 makes 1 an i^-algebra. The recogniser xi is a special 
case of a more general “parser” 

Xc ■ LQ^LQxLC 

to be defined for any F-algebra 7 : FC—>C. The terminology is motivated by 
the case when C represents the parse trees, as given by Fq below. 

Xc = f oldr((nil, nil), 9c) where &c : QxLQxLC^LQxLC performs one 
step of the parse, as will now be described. 

First, the middle component of the source is nil unless the parse has already 
failed. That is, we can re-express the source (using the appropriate isomorphism) 
as (I2xLC)+(I2xI2xLI2xLC) and then &c = [Cc, conso(idxcons)xid] where 
Cc remains to be described, by cases. 

Lemma 7. The test Eqo (n t take) o (6xid) : fixLC—> bool recognises the sub¬ 
object 

(idx@) o ((#,6)xid) : FCxLC^IixLC . 

Proof. In brief, the test picks out those pairs where the arity of the Q is no 
greater than the length of the list. In that case, we have the resources to construct 
something of type FC with a list of C’s left over. □ 

Let 4' : QC-^QxLC be the pullback of the above test along false. Then Cc 
is given by decomposing fixLC into the specified coproduct followed by 

[(nil, cons o( 7 Xid)),(j 7 Xid)ot'] : (FCxLC) + QC-^LQxLC . 
Lemma 8. If h \ (C,y)—*(C',Y) is an F -algebra homomorphism then 
(idxLh) o xc = XD ■ 

Proof. Clearly Q is a functor and 4 ' is a natural transformation. Hence C, 0 and 
X are natural with respect to F 1 -algebra homomorphisms. □ 

Hence, the construction of Fq can be given in stages by 


(nil, one) 


LQxN 
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Now we will show that Fo is an initial ^-algebra with he the unique algebra 
homomorphism to C. 

Lemma9. = (nil, Lhc) : LF 0 —*LQxLC 

Proof. It suffices to show that both sides of the equation are foldright of (nil, nil) 
and foldr(id, 6c) o (0X id). The cons case for the right-hand-side is not trivial. 
See [JC94] for details. 

□ 


Consider the following cube. 


FFo - f2xLF 0 - ldX< ^ HxLfi 



Lemma 9 implies the commutativity of its rear face. The right and bottom faces 
commute by the definitions of \ and 6. Hence, there is an induced J^-action 70 
that makes he a homorphism. (Of course, the definition of 70 and its action is 
not dependent on the particular choice of C, since we can always work over the 
algebra C = 1.) 

It remains to prove its uniqueness. Let h : Fo~>C be any F 1 -algebra homo¬ 
morphism. Then the pullback defining he factorises it as h o ho as in Fig. 1. (We 
abuse notation by denoting hp 0 by h 0 and xf 0 by Xo etc.) Hence it suffices to 
prove that ho — id. 

The following lemma shows that parsing into Fo is reversible. 


Lemma 10. @ o (id x<f>*) 0x0 = id -Ln 


Hence 


<j>oh 0 = @0 (id x<j>*) o (nil, rf) o h 0 
= @o(idx0*)oxo°0 

= <t> 


and so h 0 = id since <j> is a monomorphism. 


( 12 ) 

(13) 

(14) 
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F 0 — 

J 


Xo 

LQxLFa 
id xLh 

► LQxLC 


(nil, v) 

Fig. 1. Factorisation of he 


7.2 The General Case 

A functor F : C m xC n —>C n can be used to represent a system of (parametrised) 
domain equations [SP82], whose solution is can be found by constructing, for 
each object A in C m , an initial algebra a A : F(A,F^A)—^F^A for the functor 
F(A, —). If such always exist, then ft extends to a functor whose action on 
/ : A-+B is the F(A, —)-algebra homomorphism induced by the action 

F{A,F*B) F{B,F'B) F*B 

Further, if (3 : F(id, G)=>G : C m —>C n is a natural transformation, then the 
unique algebra homorphisms induce a natural transformation ft : F^=^G. 

Theorem 11. If <j> : F=>AIIL m+n : C m xC n —*C n is a shapely type constructor 
then F 1 exists and is one, too. Further, if /? : F(id, G)=>G : C m —>C n is a shapely 
transformation, then so is ft. 

Proof. F is determined by its projections onto C which are all shapely over 
IIpm+n gy thg ggkic Lemma, we can treat these individually, or, equivalently, 
assume that n = 1. Then for each object A in C m the initial algebra F^A for 
F{A, —) is constructed as above. 

The cartesian-ness of ft follows from that of Xa which, by Theorem 3, re¬ 
duces to that of Oq. Examination of the cases reduces this to the cartesian-ness 
of cons, r] and f3. 

The strength for F^ is defined using the defining pullback for F^(AxB) and 
the strength of LF(A, 1 )xLGA. It follows that ft is shapely. 

Now, taking /? to be 

<b /7@ 

F(A,nL m A ) IIL m AxFIL m A = II{LAx LA) -- IIL m A 

(where = permutes the arguments) induces a shapely transformation F^=^II L rn 
which shows that F^ is a shapely type constructor. □ 
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8 Related and Further Work 

Banger and Skillicorn [BS93] give a categorical semantics for arrays, which are 
represented by their dimensions and some unbounded lists. They do not use 
pullbacks to represent their types, nor is there a general theory of shape. 

Those shapely types, such as matrices, which are not part of the usual func¬ 
tional programming approach, can be represented using dependent types, but 
their type checking must be performed dynamically, whereas most shape com¬ 
putation should be performed statically. The exact relationship to dependent 
types, and the internal logic of locoses, require further exploration. 

Further work will consider how to incorporate shape ideas into program¬ 
ming language design, and compiler construction, using classes in a functional 
or object-oriented setting. 
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